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EXPRESSION OF HYDROPHOBIC PROTEINS 

Field of the Invention: 

This invention relates to the expression of non-native (ie heterologous) polypeptides 
5 which comprise a proportion of hydrophobic amino acids in an expression system such as 
a bacterial host (eg R coli). In particular, the invention, provides a method for designing 
polyepitope polypeptides with an increased probability of being efficiently expressed (ie in 
amounts detectable by SDS-PAGE). One particular application of the invention relates to 
the production of a polyepitope polypeptide comprising cytotoxic T-cell lymphocyte (CTL) 

10 epitopes from Epstein-Barr virus (EBV) for use in a polyepitope vaccine capable of eliciting 
a CTL immune response for the prevention of diseases associated with EBV (eg infectious 
mononucleosis (IM) and nasopharyngeal carcinoma (NPQ). Other particular applications 
relate to the production of polyepitope polypeptides suitable for use in polyepitope 
vaccines for preventing and/ or treating hepatitis C virus (HCV) and human 

15 immunodeficiency virus (HIV). . 

Background of the Invention: 

In order to maximise production of recombinant polypeptides in a bacterial host (eg 
E. coli), a number of parameters can be considered including factors affecting transcription 

20 (eg promoter choice, etc) and factors affecting translation mechanisms such as minimising 
the use of rare codons. However, these are unlikely to have an impact on the production of 
recombinant polypeptides comprising stretches of hydrophobic amino acids, which have 
traditionally proven difficult to produce in recombinant bacterial expression systems. 
Indeed, in the case of polypeptides comprising transmembrane sequences, the removal of 

25 these hydrophobic sequences generally improves yields of the recombinant molecule 
(Frace et al, 1999; Hobman et al, 1994; Polte et al, 1991; EMBL website-protein toxicity: 
www.embl-heidelberg.de / Externallnfo). 

The most likely reason for problems occurring in the production of foreign 
polypeptides possessing regions of hydrophobicity (particularly those with non-native 

30 sequences such as fusion proteins), is the post-translational association of nascent 

polypeptides with chaperone proteins such as R cotfgroEL. GroEL is involved in the 
refolding process of polypeptides emerging from the ribosome and proteins will recycle 
through the chaperone system until the correct conformation is achieved or the protein is 
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targeted for degradation. GroEL is known to bind hydrophobic amino acids and part of 
the refolding process is essentially to bury these hydrophobic sequences within the interior 
of the protein (Fisher and Yuan, 1994; Zahn and Pluckthun, 1994; Hayer-Hartl et al, 1994; 
Richarme and Kohiyama, 1994; Hendrick and Hard, 1995; Lin et al, 1995). 
5 Polyepitope or "polytope™" constructs (ie polypeptides comprising a tandem array 

of epitopes which may be contiguous or otherwise spaced apart by short intervening 
amino acid sequences of, for example, 1 to 5 amino acids in length), would be expected to 
be inherently unable to internalise any hydrophobic regions as they are not naturally- 
occurring sequences and lack the folding capabilities inbuilt in naturally-occurring 

10 proteins. Hence polypeptides which consist of non-native sequences, particularly those 
with a high proportion of . hydrophobic amino acids, are likely to be sequestered in the 
chaperone folding pathway and ultimately targeted for degradation if a certain degree of 
conformational stability cannot be achieved. 

Polyepitope vaccines typically comprise one or more polypeptides each made up of 

15 a tandem array of CTL epitopes. These CTL epitopes, particularly those of the HLA A2 
type, often comprise predominantly hydrophobic amino acids and since HLA A2 is 
represented in over 40% of the human population it is mandatory that these epitopes be 
included in any effective polyepitope vaccine formulation. 

Examples of polyepitope vaccines are described in Australian Patent No. 736336, 

20 the entire disclosure of which is to be regarded as being incorporated herein by reference. 
In this patent, vaccines are described which comprise a synthetic or recombinant 
polypeptide, or a recombinant vaccinia virus or DNA vaccine encoding same, wherein the 
synthetic or recombinant polypeptide typically comprises a tandem array of CTL epitopes 
(eg a tandem array of 2 to 10 CTL epitopes) wherein at least two of the CTL epitopes are 

25 contiguous or spaced apart by intervening sequences in which the intervening sequences 
do not comprise any substantial lengths of naturally occurring flanking sequences of the 
epitopes. Particularly described in the prior patent are vaccines comprising a polyepitope 
vaccinia virus encoding a polyepitope polypeptide comprising nine CTL epitopes (each 
CTL epitope being of 9 to 10 amino acids in length) from EBV. Standard chromium release 

30 assays conducted with this virus in a panel of target cells expressing the HLA alleles for 
restriction of each epitope and using autologous CTL clones specific for each epitope as 
effector cells, showed that each epitope could be efficiently processed from the polyepitope 
polypeptide since, in all cases, the CTL clones recognised and killed the HLA matched 
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target cell infected with the poly epitope vaccinia Virus, but did not kill any of the negative 
controls (ie TK-vaccinia). 

Further examples of polyepitope vaccines are described in International patent 
specification WO 0i/ 47541, the entire disclosure of which is to be regarded as being 
incorporated herein by reference. In this specification, vaccines are described which 
comprise multiple HLA epitopes wherein the multiple HLA epitopes have been sorted so 
as to minimise the number of "junctional epitopes' 1 (ie epitopes inadvertently created by the 
juxtaposition of two other epitopes) and wherein flanking amino acid residues are 
introduced wherever junctional epitopes are unavoidable. 

Polyepitope vaccines may include a large number of CTL epitopes (eg 10 or more) 
so that the HLA diversity of the target population is covered. It has therefore been 
contemplated by the present applicants to produce an EBV polyepitope vaccine which 
includes EBV epitopes restricted by HLA A2, A3, All, A23, A24, B7, B8, B27, B35, B44, B46, 
B57, B58, B60 and B62, so as to provide protection against EBV in over 90% of the human 
population. This would involve the incorporation of about 26 EBV CTL epitopes into a 
polyepitope polypeptide. For the reasons given above, it was expected that such a 
polypeptide would contain hydrophobic regions and that expression in a suitable host 
could be highly problematical. 

The work leading to the present invention was aimed at elucidating a method or 
procedure for overcoming the difficulties of expressing non-native polypeptides which 
comprise a proportion of hydrophobic amino acids (eg polyepitope polypeptides) in a 
bacterial host such as B. colL The present applicants have, as a result of that work, 
identified a novel method for designing candidate polyepitope polypeptides with an 
increased probability of being efficiently expressed in a bacterial host and /or yielding a 
purified polyepitope polypeptide which is soluble in aqueous solutions. The identification 
of this method arose out of a recognition of a need for individual epitopes to be arranged in 
a non-random way within a polyepitope polypeptide so that regions of hydrophobicity are 
distributed more evenly throughout the molecule rather than clustered in one or more 
particular regions. The novel method therefore involves identifying one or more 
hydrophobic peptide s equences within a polypeptide and arranging or re-locating at least 
one of the hydrophobic peptide sequence(s), so as to; (a) reduce or minimise amplitude (ie 
peaks) in hydrophobicity across the length of the polypeptide, and /or (b) reduce or 
minimise the total length of any hydrophobic region(s) within the polypeptide. To assist in 
the utility of this method, the present applicants have also identified an algorithm which 
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permits any number of epitopes to be readily ordered into a polyepitope polypeptide 
sequence lacking or having reduced regions of relative high hydrophobicity. 

Summary of the Invention: 

5 In a first aspect, the present invention provides a method for designing a candidate 

polypeptide for expression in a prokaryotic or eukaryotic host, said method comprising, 
identifying one or more hydrophobic peptide sequences within a polypeptide of 
interest and 

arranging or re-locating at least one of said hydrophobic peptide sequences within 
10 said polypeptide so as to generate said candidate polypeptide with reduced amplitude in 
hydrophobicity and/ or length of any hydrophobic region(s). 

Preferably, the polypeptide of interest is non-native to the intended host Since the 
most preferred host is R coli, most preferably the polypeptide is non-native to E. colL 

The polypeptide of interest will preferably be a non-natural polypeptide or even a 
15 theoretical non-natural polypeptide (ie a polypeptide yet to be synthesised or expressed) 
comprising a plurality of amino acid sequences of interest some of which may be 
hydrophobic or suspected to be hydrophobic, and which has been found not to be, or is 
suspected not to be, efficiently expressed in said host. For such a polypeptide of interest, 
the method of the first aspect provides the possibility of identifying one or more 
20 hydrophobic peptide sequences, if any, within the polypeptide of interest and arranging or 
re-locating at least one of the hydrophobic peptide sequence(s) so as to generate a 
candidate polypeptide with reduced amplitude in hydrophobicity and /or length of any 
hydrophobic region(s), and therefore an increased probability of being efficiently expressed 
in a suitable host. 

25 Preferably, the polypeptide of interest may be a synthesised or theoretical 

polyepitope polypeptide comprising a tandem array of epitopes of interest (eg CTL 
epitopes, which, as is mentioned above, often predominantly comprise hydrophobic amino 
acids). For such a polypeptide of interest, the method of the first aspect permits the design 
of candidate polyepitope polypeptides comprising a large number of epitopes of interest 

30 (eg 5 to 100 or more) with an increased probability of being efficiently expressed in a 

suitable host, by enabling the possibility of identifying one or more hydrophobic epitopes 
and arranging or re-locating at least one of the hydrophobic epitope(s), so as to generate a 
candidate polyepitope polypeptide with reduced amplitude in hydrophobicity and /or 
length of any hydrophobic region(s). 
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It has been found that the method of the first aspect is best applied to the design of 
a candidate polyepitope polypeptide in a manner which identifies and ranks the relative 
hydrophobicity of each of the selected epitopes (nb The epitopes of interest may be a.range 
of epitopes from a single pathogen (eg EBV) selected to provide a polyepitope polypeptide 
5 that covers the HLA diversity of the target population. The epitopes of interest may also be 
one or more epitopes from a range of pathogens or the epitopes may be derived from a 
non-microbial source such as a tumour cell for treating or preventing cancer.), groups the 
ranked epitopes into three groups of substantially equivalent numbers, based upon the 
identified relative hydrophobicity (ie so as to produce the groups, Group 1 = most 

10 hydrophobic, Group 2 = middle hydrophobicity, and Group 3 « least hydrophobic and 
"residual" epitopes where the total number of epitopes (N) is not wholly divisible by 3), 
and then arranges the epitopes into triplets where the triplets contain an epitope from each 
group (ie three linked epitopes; epitope 1 - epitope 2 - epitope 3) and arranged into a 
candidate polyepitope polypeptide having the formula, Triplet 1 - Triplet 2 - .... - Triplet 

15 N/ 3, as follows: 





Epitope 1 


Epitope 2 


11 

Epitope 3 


Triplet 1 (N-terminal) 


Most hydrophilic of 
Group 2 


Most hydrophobic of 
Group 1 


Most hydrophilic of 
Group 3 


Triplet 2 


2 nd most hydrophilic 
of Group 2 


2 nd most hydrophobic 
of Group 1 


2 nd most hydrophilic 
of Group 3 










Triplet N/3 (C- 
terminal) 


Most hydrophobic of 
Group 2 


Most hydrophilic of 
Group 1 


Most hydrophobic of 
Group 3 



(Any "leftover 1 ' epitope(s) (ie least hydrophilic epitope(s) of Group 3) may be added to the C-termina] of Triplet N/3, or 
otherwise may be located within the candidate polyepitope polypeptide sequence so as to reduce any local peaks of 
20 hydrophobicity.) 

Between the epitope triplets, or between any or all of the epitopes within a triplet, 
there may be intervening sequences (preferably short sequences of 1 to 10 amino acids) 
which may optionally be hydrophilic (eg lysine-lysine) so as to reduce any local peaks of 
25 hydrophobicity. Preferably, the epitopes within a triplet are contiguous. 

Other simple methods for arranging the epitope(s) so as to minimise extremes in 
hydrophobicity in a polyepitope polypeptide will be readily apparent to persons skilled in 
the art, and are to be considered as forming part of the present invention. For example, in 
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one variation of the above method, and referring again to Table 1, epitope 1 would instead 
be selected using the stated criteria for epitope 2, epitope 2 would instead be selected using 
the stated criteria for epitope 3, and epitope 3 would instead be selected using the stated 
criteria for epitope 1. In another variation, the epitopes would be selected from four 
groups of ranked epitopes and consequently arranged into sets of 4 epitopes. 

As mentioned above, the method of the first aspect generates a candidate 
polypeptide with reduced amplitude in hydrophohicity and/ or length of any hydrophobic 
region(s). In the context of applying the method to a natural polypeptide, "reduced 
amplitude in hydrophobicity" is to be understood to mean that any peaks of 
hydrophobicity of the candidate polypeptide (ie as may be calculated /measured using 
Pinsoft 2 from Mimotopes Pty Ltd, Melbourne, Australia) is reduced relative to the natural 
polypeptide, and that "reduced length of any hydrophobic region(s)" is to be understood to 
mean that the length of amino acid sequence of any hydrophobic region(s) in the candidate 
polypeptide is/ are reduced relative to the natural polypeptide. In the context of applying 
the method to a non-natural polypeptide (including a theoretical non-natural polypeptide), 
"reduced amplitude in hydrophobicity" is similarly to be understood to mean that any 
peaks of hydrophobicity of the candidate polypeptide (ie as may be calculated /measured 
using Pinsoft 2 from Mimotopes Pty Ltd) is reduced relative to the non-natural 
polypeptide, and that "reduced length of any hydrophobic region(s)" is to be understood to 
mean that the length of amino acid sequence of any hydrophobic region(s) in the candidate 
polypeptide is/ are reduced relative to the non-natural polypeptide. In the context of 
applying the method to the more specific non-natural polypeptide example of a 
polyepitope polypeptide, "reduced amplitude in hydrophobicity" is to be understood to 
mean that any peaks of hydrophobicity of the candidate polypeptide (ie as may be 
calculated using the mathematical expression described below) is reduced relative to most 
of the possible random arrangements of the epitopes comprising the polyepitope 
polypeptide, and that "reduced length of any hydrophobic region(s)" is to be understood to 
mean that the length of amino acid sequence of any hydrophobic region(s) in the candidate 
polypeptide is/ are reduced relative to most of the possible random arrangements of the 
epitopes within the polyepitope polypeptide. 

Once a candidate polypeptide has been designed in accordance with the method of 
the first aspect, a polynucleotide encoding the candidate polypeptide may be synthesised 
according to any of the methods well known to persons skilled in the art. The encoding 
polynucleotide may be incorporated into, for example, vectors such as viral vectors (eg 
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vaccinia to provide a recombinant polyepitope viral vaccine) or expression vectors such as 
those suitable for expression in a suitable host. 

Thus, in a second aspect, the present invention provides a method of expressing a 
polypeptide in a suitable host, said method comprising, 
5 designing a polypeptide in accordance with the method of the first aspect, 

introducing a polynucleotide encoding said polypeptide into said host, such that 
said host is capable of expressing said polypeptide, and 

culturing said host under conditions suitable for expression of said polypeptide. 

The expressed polypeptide may be isolated by, for example, lysing the host cell and 
10 purifying the polypeptide from the produced cell lysate. 

The polynucleotide introduced into the host cell may encode the polypeptide in the 
form of a fusion of the polypeptide with a suitable carrier protein. Alternatively, the 
polypeptide could be expressed and subsequently linked to or otherwise associated with a 
suitable carrier protein. Suitable carrier proteins are well known to persons skilled in the 
15 art and include (3-galactosidase, glutathione S-transferase and the gp350 structural protein 
from EBV or a fragment thereof. The carrier protein may comprise additional useful 
epitopes. Further increases in expression benefits provided by ordering may be conferred 
by the carrier protein. 

In a third aspect, the present invention provides a polypeptide designed in 
20 accordance with the method of the first aspect. 

If desired, the polypeptide of the third aspect may be in the form of a fusion of the 
polypeptide with a suitable carrier protein. 

In a fourth aspect, the present invention provides a polyepitope polypeptide 
designed in accordance with the method of the first aspect. 
25 If desired, the polypeptide of the fourth aspect may be in the form of a fusion of the 

polypeptide with a suitable carrier protein. 

In a fifth aspect, the present invention provides a polyepitope polypeptide 
comprising N epitopes, wherein N is any integer (preferably an integer in the range of 5 to 
100, and more preferably, 10 to 35), said polyepitope polypeptide having the formula; 
30 Triplet 1 - Triplet 2 - .... - Triplet N/3, 

wherein each of said triplets comprises three linked epitopes selected by, 

identifying and ranking the relative hydrophobicity of each of the N epitopes, 

grouping the ranked N epitopes into three groups of substantially equivalent 
numbers, based upon the identified relative hydrophobicity of the N epitopes, to produce a 
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first group (ie Group 1) comprising the most hydrophobic epitopes, a second group (ie 
Group 2) comprising the epitopes having a middle level of hydrophobicity, and a third 
group (ie Group 3) comprising the least hydrophobic epitopes, and 

selecting the epitopes for each of said triplets according to the following table: 

5 





Epitope 1 


Epitope '2 


Epitope 3 


Triplet 1 (N-terminal) 


Most hydrophilic of 
. Group 2 


Most hydrophobic of 
Group 1 


Most hydrophilic of 
Group 3 


Triplet 2 


. 2 nd most hydrophilic 
of Group 2 


2 nd most hydrophobic 
of Group 1 


2 nd most hydrophilic 
of Group 3 










Triplet N/3 (C- 
terminal) 


Most hydrophobic of 
Group 2 


Most hydrophilic of 
Group 1 


Most hydrophobic of 
Group 3 



Preferably, the first, second and third groups comprise identical numbers of 
epitopes. Where N is an integer not wholly divisible by 3 (ie an integer other than, for 
example, 6, 9, 12, 15, and 18), then the residual epitopes are preferably included within the 
10 third group. 

At the end of the step of selecting epitopes for each of said triplets, if there is /are 
any leftover epitope(s) (ie least hydrophilic epitope(s) of Group 3), then this/ these may be 
added to the C-terminal of Triplet N/3, or otherwise may be located within the candidate 
polyepitope polypeptide sequence so as to reduce any local peaks of hydrophobicity. 

15 Between the epitope triplets, or between any or all of the epitopes within a triplet, 

there may be intervening sequences (preferably short sequences of 1 to 10 amino acids) 
which may optionally be hydrophilic (eg lysine-lysine) so as to reduce any local peaks of 
hydrophobicity, or otherwise avoid the creation of junctional epitopes. Preferably, the 
epitopes within a triplet are contiguous. 

20 If desired, the polyepitope polypeptide of the fifth aspect may be in the form of a 

fusion of the polyepitope polypeptide with a suitable carrier protein. 

In a sixth aspect, the present invention provides a polypeptide vaccine comprising a 
polyepitope polypeptide according to the fourth or fifth aspect and a pharmaceutical^ 
acceptable carrier and/ or adjuvant. 

25 In a seventh aspect, the present invention provides a polyepitope polypeptide 

comprising an amino acid sequence substantially corresponding to an amino acid sequence 
selected from the group consisting of: 
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FLRGRAYGL - PYLFWLAAI - HRCQAIRKK - KRIYDLIEL - VQPPQLTLQV- 
GLCTLVAML - RLRAEAQVK -IEDPPFNSL -YLLEMLWRL - GQGGSPTAM - 
AVLLHEESM - IALYLQQNWWTL-RAKFKQLL - SSCSSCPLSKI- TYGPVFMCL- 
5 QAKWRLQTL - RPPEFIRRL- VSFIEFVGW - YPLHEQHGM -VEITPYKPTW - 

CLGGLLTMV - EENLLDFVRF - TYSAGIVQI - LLDFVRFMGV -EGGVGWRHW 
(SEQIDNO:l), 

FLRGRAYGL - PYLFWLAAI - HRCQAIRKK - RRIYDLIEL - GLCTLVAML - 
RLRAEAQVK - IEDPPFNSL -TYSAGIVQI - LLDFVRFMGV - EGGVGWRHW - 
IALYLQQNWWTL - RAKFKQLL - SSCSSCPLSKI - TYGPVFMCL - QAKWRLQTL- 
RPPIFIRRL - VSFIEFVGW - YPLHEQHGM - VEITPYKPTW - CLGGLLTMV - 
EENLLDFVRF - YLLEMLWRL - GQGGSPTAM - AVLLHEESM - VQPPQLTLQV 
(SEQIDNO:2), 

SSCSSCPLSKI - HRCQAIRKK - CLGGLLTMV - LTAGFLIFL - RLRAEAQVK - 
IEDPPFNSL - LLSAWILTA - RRRWRRLTV - PYLFWLAAI - YLLEMLWRL - 
GQGGSPTAM - VMSNTLLSAW - ALLVLYSF A - RAKFKQLL - IALYLQQNW - 
TYGPVFMCL - QAKWRLQTL - YLQQNWWTL - YPLHEQHGM - CPLSKILL 
(SEQIDNO:3), 

IPTVAIVALV - RLRPGGKKK - ILKEPVHGV - PLVKLWYQL - RPGGKKKYKL - 
KYKLKHTVW - TWETWWTEYW - EIKDTKEAL - KRWIILGLNK - 
KLWVTVYYGV - KIEELRQHL - MTNNPPDPV - VTLWQRPLV - WASRELERF - 
25 LLWKGEGAV - YTAFTIPSI - P/QEPFKNLK - SLYNTVATL - AURILQQL - 

AIFQSSMTK - VIYQYMDDL - LVGPTPVNI - TPQDLNTML - YLAWVPAHK - 
ALVEICTEM - TLNAWVKW (SEQ ID NO:4), 

and 

30 

LLFNILGGWV - KTSERSQPR - FLLLADARV - LLFLLLADA - RLGVRATRK - 
GVAG ALVAFK - LPGCSFSIF - RMYVGGVEHR - VAGALVAFK - DLMGYIPLV - 
LEFCHSKKK - ILAGYGAGV - HMWNFISGI - QLFTFSPRR - VGTrtXPNR - 
FWAKHMWNF - YLVTRHADV - LSAFSLHSY - WMNRLIAFA - YLLPRRGPRL - 



10 



15 



20 
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YLVAYQATV - RLIVFPDLGV - TLGFGAYMSK - IPFYGKAJ - VLVGGVLAA - 
CTCGSSDLY (SEQ ID NO:5). 

In an eighth aspect, the present invention provides a polyepitope vaccine 
comprising a polyepitope polypeptide according to the seventh aspect and a 
pharmaceutical^; acceptable carrier and/ or adjuvant. 

Detailed Description of the Invention: 

The present applicants have identified novel methods for designing a candidate 
polyepitope polypeptide, with an increased probability of being efficiently expressed in a 
prokaryotic or eukaryotic host (ie in amounts detectable by SDS-PAGE). The method 
involves identifying one or more hydrophobic epitope(s) and arranging or re-locating at 
least one of the hydrophobic epitope(s) so as to generate a candidate polyepitope 
polypeptide with reduced amplitude in hydrophobicity and /or length of any hydrophobic 
region(s): 

An algorithm to calculate hydrophobicity values of amino acid sequences and 
subsequently arrange sequences to; (a) reduce or minimise amplitude in hydrophobicity, 
and/ or (b) reduce or minimise the length of hydrophobic sequences, was generated and 
initially applied to 26 CTL epitope sequences from EBV. This resulted in the design of two 
initial candidate polyepitope polypeptides (designated PT26A and PT26B, described 
hereinafter), one of which proved to be efficiently expressed in B. coll The expressed 
polyepitope polypeptide shows promise as the basis of an EBV vaccine for prevention or 
treatment of infectious mononucleosis and/ or EBV-related cancers such as Burkitts 
lymphoma, Hodgkin's disease, non-Hodgkin's lymphoma, nasopharyngeal carcinoma, 
gastric adenocarcinoma, lymphomas associated with immunosupression, 
lymphoepithelioma-like carcinomas, and immunodeficiency-related leiomyosarcoma. 

While looking for an explanation as to why the expression capabilities of R coliior 
the two similar candidate polyepitope polypeptides were different, summations of 
hydrophobicity values (designated Hydrophobic Index (HI) values) were calculated for 
different numbers of epitopes over the length of the candidate polyepitope polypeptides to 
identify local areas of hydrophobicity. Summation over 3 and 4 epitopes showed that there 
were regions in the non-expressed polypeptide where the HI value was higher than in the 
expressed polypeptide. This information enabled the identification of a threshold HI 
value, such that polypeptide sequences which comprised a region with an HI value in 



WO 2004/007556 



PCT/AU2003/000910 



11 

excess of the threshold value, could be predicted as being less likely of being efficiently 
expressed in a bacterial host. 

Thus, in a preferred embodiment of the methods of the present invention for 
designing candidate polypeptides, the methods involve initially calculating hydrophobicity 
5 values and arranging peptide sequences to; (a) reduce or minimise amplitude in 

hydrophobicity, and (b) reduce or minimise the length of hydrophobic sequences; and then 
"fine-tuning", if necessary, by calculating the HI values over different peptide sequence 
groups, thus providing numerical values for comparison and prediction of the likelihood of 
a candidate polypeptide sequence being efficiently expressed in a bacterial host. So, in 
10 applying this preferred embodiment to the design of a candidate polyepitope polypeptide, 
the method involves: 

(i) Calculating the hydrophobic value for each epitope using a suitable algorithm (eg 
Fauschere and Pliska, 1983 contained within the software package f, Pinsoft 2" from 
Mimotopes Pty Ltd). 

15 (ii) Ranking the set of epitopes in order of decreasing hydrophobicity. 

(iii) Dividing the rank ordered set of epitopes into a number of equal groups, (eg three 
equal groups wherein group 1 = most hydrophobic, group 2 = middle hydroplSbicity and 
group 3 = least hydrophobic (most hydrophilic)), and including any residual epitopes in 
the most hydrophilic group. 

20 (iv) Creating sets (eg triplets) of epitopes by taking, in for example a case where the 

epitopes have been divided into three groups, the most hydrophilic epitope of group 2 (ie 
last in group 2), then the most hydrophobic epitope (ie number 1 in group 1) and lastly the 
most hydrophilic epitope (ie last in group 3) until all epitopes in groups 1 and 2 have been 
used (nb "Leftover" epitopes are handled as set out in step (ix) below). 

25 (v) Arranging the sets of epitopes (eg triplets) into a sequence in the order in which 
they were produced (eg Triplet 1 - Triplet 2 - Triplet 3 - etc). 

(vi) Plotting the hydrophobicity of the arranged polyepitope polypeptide sequence 
using a suitable algorithm (eg Fauschere and Pliska, 1983, or Hopp and Woods, 1981). 

(vii) If necessary, reducing hydrophobic amplitude by re-locating sets of epitopes (eg 
30 triplets) from areas of low hydrophobicity into areas of high peak hydrophobicity and / or 

by re-locating individual hydrophobic (ie group 1) epitopes from areas of peak 
hydrophobicity into areas of low hydrophobicity. 

(viii) Re-calculating the hydrophobicity plots and continuing, if necessary, to shuffle sets 
of epitopes (eg triplets) as in step (vii) above to generate a final sequence arrangement. 
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(ix) Placing any leftover epitopes (eg least hydrophilic epitopes of group 3) at the C- 
terminal of the final sequence arrangement or other location so as to further reduce local 
peaks in hydrophobicity (ie by inserting them adjacent to epitopes of peak 
hydrophobicity). 

(x) Placing any affinity tags (usually hydrophilic, eg a hexa-histidine sequence) at 
either the N- or C- terminal of the final polyepitope polypeptide sequence or at the C- 
terminal if the final polyepitope polypeptide sequence is to be expressed as a fusion 
protein. 

The HI values may be calculated by using the mathematical expression: 

e-m+n-1 
HI rn = Zx e 
e=m 

(where m = group number, n = group size, e = integer (epitope) number, x = epitope 
hydrophobicity value). 

Preferably, the HI values are calculated using this mathematical expression when 
n=3 and n=4. In the examples provided hereinafter, this calculation predicted that to be 
able to express linked, random, short amino acid sequences in R coliin SDS-PAGE 
detectable amounts, the hydrophobic index over groups of three epitope sequences would 
need to be less than 1.8 (HI 3 < 1.8) and/ or that over groups of four epitope sequences, the 
. hydrophobic index would need to be less than 2.5 when x was calculated using Pinsoft 2 
(Mimotopes Pty Ltd) and specifying the N-terminus as N-acetyl and the C-terminus as 
carboxy amide. Different cut-off values will be obtained with different hydrophobicity 
algorithms. 

It will be readily appreciated that the calculation of HI values in this manner, would 
be useful for predicting whether a natural, non-bacterial polypeptide or a derivative 
thereof may be efficiently expressed in E. coll The present invention therefore further 
extends to a method of predicting efficient expression of a polypeptide in a suitable host 
(eg a bacterial host), involving calculating HI values in accordance with the mathematical 
expression: 

e=m+n-l 
HI m = 2 x e 
e==m 

(where m = group number, n = group size, e = integer (epitope) number, x = epitope 
hydophobicity value). 
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The methods of the present invention permit the design of candidate polyepitope 
polypeptides comprising a large number of epitopes of interest (eg 5 to 100 or more) with 
an increased probability of being efficiently expressed in a suitable host (eg a bacterial 
host). The epitopes of interest may be a range of epitopes from a single pathogen selected 
5 to provide a polyepitope polypeptide that covers the HLA diversity of the target 
population, or the epitopes of interest may be one or more epitopes from a range of 
pathogens or tumour antigens. As is evident from the above, one particular application of 
the methods of the present invention is to the design of candidate polyepitope 
polypeptides comprising 26 EBV CTL epitopes for use in a vaccine to provide protection 

10 against EBV in over 90% of the human population, or for treating diseases associated with 
EBV such as NPC Other particular applications of the methods of the present invention 
relate to the design of candidate polyepitope polypeptides suitable for use in polyepitope 
vaccines for preventing and /or treating HCV and HIV. Another particular application of 
the methods of the present invention is to the design of candidate polyepitope 

15 polypeptides comprising CTL epitopes from cytomegalovirus (CMV), for use in a vaccine 
to prevent or treat CMV-causative diseases. 

A candidate polypeptide designed in accordance with the methods of the present 
invention may be expressed by firstly synthesising a polynucleotide encoding the 
candidate polypeptide according to any of the methods well known to persons skilled in 

20 the art, and then by introducing the polynucleotide into a suitable host. Typically, this will 
be achieved by cloning the polynucleotide into an expression vector and then introducing 
the expression vector into said host by any of the transformation methods well known to 
persons skilled in the art. Expression from the expression vector may result in the 
polypeptide being expressed as a fusion protein comprising the polypeptide and a suitable 

25 carrier protein (eg (3-galactosidase, glutathione S-transferase or the gp350 structural protein 
from EBV or a fragment thereof). Alternatively, the polypeptide may be expressed by the 
host cell, and following isolation of the polypeptide, the polypeptide may be linked to or 
otherwise associated with a suitable carrier protein. The carrier protein may also confer 
additional useful properties (ie the carrier protein may comprise useful epitopes or 

30 sequences to enhance solubility, further enhance purification procedures, facilitate 
association with an adjuvant or to which an immune response is desirable). 

It is further contemplated that candidate polypeptides designed in accordance with 
the methods of the present invention may be expressed in prokaryotic expression systems 
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other than R coll Typical alternative systems include B. subtilis, Salmonella sp., 
Streptococcus sp v Lactobacillus sp v and Streptomyces sp. 

It is also contemplated that candidate polypeptides designed in accordance with the 
methods of the present invention may be readily expressed in whole cell lysates and non- 
5 bacterial host cells as well, and accordingly such alternative expression methods for 

candidate polypeptides are to be considered as forming part of the present invention. In 
particular, the present invention is to be considered as extending to a method of expressing 
a polypeptide in a non-bacterial host cell such as a mammalian cell (eg a CHO cell or COS 
cell line), a yeast cell (eg Saccharomyces cerevisiae) or insect cell (eg SF9 cell line), wherein 

10 the method comprises designing a polypeptide in accordance with the method of the first 
aspect, introducing a polynucleotide encoding the polypeptide into the host cell such that 
the host cell is capable of expressing the polypeptide, and culturing the host cell under 
conditions suitable for expression of the polypeptide. The expressed polypeptide may be 
isolated from the host cell culture by lysing the cells and purifying the polypeptide from 

15 the produced cell lysate, or alternatively, the polypeptide could be expressed with a 
suitable secretion signal such that the polypeptide is secreted into the culture medium 
(from where it may be purified). Designing a polyepitope polypeptide in accordance with 
the methods of the present invention may also overcome non-secretion problems which are 
sometimes experienced when a hydrophobic polypeptide is expressed with a foreign 

20 secretion signal. 

Where the expressed polypeptide is of pharmacological or veterinary significance, 
the polypeptide may be formulated into a pharmaceutical or veterinary composition. 
Generally, such compositions will comprise a pharmaceutically acceptable or veterinary 

acceptable carrier, and may include other substances and excipients as may be required. 

> 

25 Polyepitope polypeptides may be formulated into vaccine compositions. Generally, 

such compositions will comprise a pharmaceutically acceptable or veterinary acceptable 
carrier and may include adjuvants (eg an ISCOM™ adjuvant, DEAE, polysaccharides, 
saponins, liposomes and virus-like particles), and other substances and excipients as may 
be required. The vaccine compositions may also include helper epitopes /CD4 epitopes or 

30 B-cell epitopes. The vaccine compositions may be adapted for administration to a subject 
by, for example, intramuscular injection, nasal administration via an 'aerosol spray, or oral 
administration. Preferably, the vaccine compositions are ISCOM™ adjuvant compositions. 

Polyepitope polypeptides may also be administered to a subject in the form of a 
viral vaccine (eg a recombinant polyepitope vaccinia or adenovirus) or DNA vaccine. 
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Thus, in a further aspect, the present invention provides a polynucleotide vaccine 
comprising a polynucleotide encoding a polypeptide designed in accordance with the 
method of the first aspect, and a pharmaceutical^ acceptable carrier and /or adjuvant. 

Throughout this specification the word "comprise", or variations such as 
"comprises" or Comprising", will be understood to imply the inclusion of a stated element, 
integer or step, or group of elements, integers or steps, but not the exclusion of any other 
element, integer or step, or group of elements, integers or steps. 

The term "substantially corresponding" as used herein in relation to an amino acid 
sequence is intended to encompass the exact amino acid sequence as well as minor 
variations which do not result in a substantial decrease in the biological activity of the 
amino acid sequence (eg variations which do' not diminish the ability of an epitope to 
provoke a CTL immune response). These variations may include one or more conservative 
amino acid substitutions. The conservative amino acid substitutions envisaged are: G, A, 
V, I, L, M; D, E, N, Q; S, C, T; K, R, H; and P, Na-alkylamino acids. 

Any discussion of documents, acts, materials, devices, articles or the like which has 
been included in the present specification is solely for the purpose of providing a context 
for the present invention. It is therefore, not to be taken as an admission that any or all of 
these matters form part of the prior art base or were common general knowledge in the art 
relevant to the present invention as it existed in Australia or elsewhere before the filing or 
priority date of the present specification. 

The invention will hereinafter be further described by way of the following non- 
limiting examples and accompanying figures. 

Brief Description of the accompanying Figures: 

Figure 1 provides the epitope configuration and amino acid sequences for EBV- 
polyepitope polypeptides, PT26A and PT26B. Numbers above epitopes represent 
hydrophobicity values for each epitope calculated withPinsoft 2 software, specifying an 
acetyl N-terminal and an amide C-terminal. 

Figure 2 provides hydrophobicity plots of PT26A and PT26B. Hydrophobicity 
values of a moving nine amino acid window are derived using the algorithm of Fauchere 
and Pliska, 1983. 

Figure 3 provides the amino acid sequence of the fusion of residues 21 to 447 of EBV 
gp350 to PT26A (A) or to PT26B (B). 
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Figure 4 shows Coomassie stained SDS-PAGE gels showing the time course of 
expression following induction with EPTG (+IPTG or I) of: (A) PT26A (at approximately 
30kDa), (B) gp350/PT26A (at approximately 80kDa), and (C) PT26B expression. Arrows 
indicate the location of recombinant protein. 
5 Figure 5 provides ELISPOT assay results of CTL responses to the five HLA A2 

epitopes contained within the polyepitope polypeptides, PT26A and PT26B, under two 
formulation conditions: (A) lO^ig gp350-PT26A, (B) lO^ig gp350-PT26B, and (C) positive 
control peptide mix cpntaining a mixture of each of the 5 A2 epitopes contained in the 
polypeptide polypeptide. Each of the epitopes is represented below by their first 3 amino 
10 acids. The CTL response of each mouse JVQ-M5 to each A2 epitope is presented as a bar . 
indicating the number of IFN-y spots produced. 

Figure 6 provides the epitope configuration and amino acid sequences for an EBV 
polyepitope polypeptide, EBV-NPCa. Numbers above epitopes represent hydrophobicity 
values for each epitope calculated with Pinsoft 2 software, specifying an acetyl N-terminus 
15 and an amide C-terminus. 

Figure 7 provides the epitope configuration and amino acid sequences for HIV 
polyepitope polypeptides, HIVa and HIVb. Numbers above epitopes represent 
hydrophobicity values for each epitope calculated with Pinsoft 2 software, specifying an 
acetyl N-terminus and an amide C-terminus. Hydrophobic Index values across 3 epitopes 
20 (n = 3) are shown below. 

Figure 8 provides the epitope configuration and amino acid sequences for HCV 
polyepitope polypeptides, HCVa and HCVb. Numbers above epitopes represent 
hydrophobicity values for each epitope calculated with Pinsoft 2 software, specifying an 
acetyl N-terminus and an amide C-terminus. Hydrophobic Index values across 3 . epitopes 
25 (n = 3) are shown below. 

Figures 9 and 10 show Coomassie stained SDS-PAGE gels and also immunoblots 
from an expression time course of constructs HIVa, HTVb, HCVa and HCVb. Figure 9 - 
Panel A: 1 hour timepoint Lane 1) Novex SeeBiue+2 MW markers; 2) Negative vector/host 
control; 3) HIVa uninduced; 4) HIVa induced; 5) HIVb uninduced; 6) HIVb induced; 7) 
30 HCVa uninduced; 8) HCVa induced; 9) HCVb uninduced; 10) HCVb induced. Panel B: 2 
hour timepoint. Lane 1) Negative vector/host control; 2) Novex SeeBlue+2 MW markers; 3) 
HIVa uninduced; 4) HTVa induced; 5) HIVb uninduced; 6) HTVb induced; 7) HCVa 
uninduced; 8) HCVa induced; 9) HCVb uninduced. Figure 10 - Panel A: 3 hour timepoint 
Lane 1) Negative vector/host control; 2) HIVa uninduced; 3) HIVa induced; 4)HIVb 
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uninduced; 5) HlVb induced; 6) Novex SeeBlue+2 MW markers; 7) HCVa uninduced; 8) 
HCVa induced; 9) HCVb uninduced; 10) HCVb induced. Panel B: Overnight timepoint. 
Lane 1) Negative vector/ host control; 2) HIVa uninduced; 3) HIVa induced; 4)HTVb 
uninduced; 5) HIVb induced; 6) HCVa uninduced; 7) HCVa induced; 8) HCVb uninduced; 
5 9) HCVb induced; 10) Novex SeeBlue+2 MW markers. 

Example 1: EBV polyepitope fusions as vaccine candidates. 

MATERIALS AND METHODS 
Epitope sequences 

10 The 26 CTL epitopes for inclusion in an EBV vaccine, the proteins from which they 

originate and HLA type are shown in Table 1. 
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TABLE 1. CTL epitopes included in the EBV polytopes 



HLA Type 


EBV Protein 


Epitope 


A2 


LMP2 


CLGGLLTMV (SEQ ID NO:6) 




BMLF1 


GLCTLVAML (SEQ ID NO:7) 




EBNA6 


LLDFVRFMGV (SEQ ID NO:8) 




LMP1 


YLLEMLWRL (SEQ ID NO:9) 




LMP1 


YLQQNWWTL (SEQ ID NO:10) 


A3 


EBNA3 


RLRAEAQVK (SEQ ID NO:ll) 


All 


LMP2 


SSCSSCPLSKI (SEQ ID NO:12) 


A23 


LMP2 


PYLFWLAAI (SEQ ID NO:13) 


A24 


LMP2A 


TYGPVFMCL (SEQ ID NO:14) 




EBNA4 


TYSAGIVQI (SEQ ID NO:15) 


B7 


EBNA3 


RPPIFIRRL (SEQ ID NO:16) 


B8 


EBNA3 


FLRGRAYGL (SEQ ID NO:17) 




EBNA3 


QAKWRLQTL (SEQ ID NO:18) 




BZLF1 


RAKFKQLL (SEQ ID NO:19) . 


B27 


EBNA4 


HRCQAIRKK (SEQ ID NO:20) 




EBNA6 


RPJYDLIEL (SEQ ID NO:21) 


B35 


EBNA4 


AVLLHEESM (SEQ ID NO:22) 




EBNA3 


YPLHEQHGM (SEQ ID NO:23) 


B44 


EBNA6 


EENLLDFVRF (SEQ ID NO:24) 




. EBNA6 


EGGVGWRHW (SEQ ID NO:25) 




EBNA4 


VEITPYKPTW (SEQ ID NO:26) 


B46 


EBNA3 


VQPPQLTLQV (SEQ ID NO:27) 


B57 


LMP1 


IALYLQQNW (SEQ ID NO:28) 


B58 


EBNA4 


VSFIEFVGW (SEQ ID NO:29) . 


B60 


LMP2 


IEDPPFNSL (SEQ ID NO:30) 


B62 


EBNA4 


GQGGSPTAM (SEQ ID NO:31) 
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Design/ Ordering of epitopes 

The following method was used to generate ordered arrangements of CTL epitopes 
to produce a polyepitope sequence with favourable hydrophobicity characteristics: 

5 (i) The hydrophobic value for each epitope was calculated using a suitable algorithm 
(ie Pinsoft 2 from Mimotopes Pty Ltd, specifying the N-terminus as N-acetyl and the C- 
terminus as carboxy amide). 

(ii) The set of epitopes was ranked in order of decreasing hydrophobicity. 

10 

(iii) The rank ordered set of epitopes was divided into 3 equal groups (ie group 1 = most 
hydrophobic, group 2 = middle hydrophobicity and group 3 = least hydrophobic (most 
hydrophilic)). Residual epitopes (ie 2 epitopes left over after the set of 26 was divided by 
3), were included in the most hydrophilic group. 

15 

(iv) Triplets of epitopes were created by taking the most hydrophilic of group 2 (ie last . 
in group 2), then the most hydrophobic epitope (ie number 1 in group 1) and lastly the 
most hydrophilic (ie last in group 3). This was continued until all epitopes in groups 1 and 
2 had been used (nb "Leftover" epitopes were added to the C-terminal end of the final 

20 sequence arrangement). 

(v) The triplets were then arranged into a sequence in the order in which they were 
produced (ie Triplet 1 - Triplet 2 - Triplet 3 - etc). 

25 (vi) The hydrophobicity of this triplet arrangement was then plotted using a suitable 
algorithm (ie Fauschere and Pliska). 

(vii) If and where necessary, relocating triplets from areas of low hydrophobicity into 
areas of high peak hydrophobicity in order to reduce hydrophobic amplitude. 

30 

(viii) Re-calculating the hydrophobicity plots and continuing, if necessary, to shuffle 
triplets as in the step (vii) above. 
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(ix) Any residual epitopes (ie least hydrophilic of group 3) can be placed C-terminally 
in the final sequence arrangement or can be used to further reduce iocal peaks in 
hydrophobicity by inserting them adjacent to epitopes of peak hydrophobicity, according 
to a hydrophobicity plot of the assembled triplets. 

(x) Any affinity tags (usually hydrophilic, eg a hexa-histidine sequence) should be 
located either N- or C- terminally or preferably C-terminally if the construct is a fusion 
protein. 

(xi) Confirmation of satisfactory HI. 

Using this process, EBV polyepitope configurations PT26A and PT26B were created. 
The example of PT26 A is shown in Table 4. 

Hydrophobic Index (HI) calculations 

HI values for favourable configurations (PT26A, PT26B) were calculated according 
to the mathematical expression:- 

e=m+n~l 
e=m 

(where m = group number, n = group size, e = integer (epitope) number, x - epitope 
hydophobicity value) over values of n from 2 to 5. 

Preparation of recombinant proteins 

DNA sequences encoding the polyepitopes (PT26A and PT26B) were generated 
from synthetic oligonucleotides using a Splicing by Overlap Extension technique (SOE) as 
described by Horton et al (1995). The codon usage was optimised for R coli expression 
(Wada et al 1992). The polyepitopes were tagged at the C-terminus with a hexa-histidine 
tag for protein purification and detection. The DNA was subcloned into pET28b 
(Novagen) and transformed into E. o?//BL21(DE3) cells (Novagen) for expression. 

A fragment corresponding to the N-terminal region (amino acid residues 21 - 447) 
of EBV gp350 was amplified from plasmid DNA containing the full length gp350 sequence . 
by PCR using the following oligonucleotides: 

5' AGGGATCCCATGGAAGATCCTGGTTTTTTC 3' (forward) (SEQ ID NO:32) and 
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5' TCTAGAGGTCGACACCTGTCGTTGTATTGGG 3' (reverse) (SEQ ID NO:33). This 
DNA fragment was subcloned into pET28b (Novagen) containing the polyepitope insert, 
resulting in an in-frame fusion between gp350 and the polyepitope polypeptide. The 
constructs were referred to as gp350/PT26A (Figure 3A) and gp350/PT26B (Figure 3B). 
5 For protein expression testing, transformed cells were grown in 50mg/ml 

Kanamycin containing L broth at 37°C until OD600 reached 2. Protein expression was 
induced by the addition of IPTG (0.25mM) and cells were grown for another 3 hours. Cells 
were pelleted and boiled in SDS sample buffer before analysing by SDS-PAGE. 

For protein purification, cells induced by IPTG were pelleted, resuspended in 
10 binding buffer (20mM Tris-HCl pH 7.9, 0.5M NaCI, 5mM imidazole) and then sonicated. 
Inclusion bodies were pelleted and washed in buffer. The proteins were solubilised 
overnight in binding buffer containing 8M urea and purified on a Ni ++ -NTA column. 

Preparation of ISCOM™ formulations 

15 ISCOMATRIX™-adjuvant was prepared by combining adjuvant components in a 

formulation vessel. Cholesterol, 1,2, dipalmitoyl phosphatidylcholine (DPPC), and 
ISCOPREP™ as a source of purified Quillaja saponins, were mixed in a weight ratio of 1:1:5 
in the presence of the detergent Mega-10 (United States Patent No. 5,679,354) at a 
concentration of 2%. The detergent was removed by diafiltration with PBS and the 

20 formation of ISCOMATRIX™ confirmed by negative contrast electron microscopy 

revealing complexes including cage-like structures typically with a diameter of 40nm. 
ISCOM™-adjuvanted vaccines were prepared by mixing the EBV polyepitope antigen with 
preformed ISCO!VLATRIX TM -adjuvant, which was prepared as described below: The dose 
strength of ISCOM-adjuvant as saponin was quantified by reverse phase HPLC assay. 

25 ISCOM™ vaccines were prepared by gentle mixing at 22°C of an equal volume of 

2x final dose strength ISCOMATRIX™ with an equal volume of 2x final dose strength EBV 
polyepitope antigen (gp350-PT26A and B). After 60 minutes, the formulation was 
subjected to extensive dialysis, in order to remove urea, at 4°C into PBS buffer pH6.2 using 
12,000 molecular weight cut off dialysis membrane (Cellu Sep T3, San Antonio Texas). 



Mouse immunogenicdty 
Dosing 
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Female HLA A2 transgenic C57B1/6 mice (HDD) were bred at Queensland Institute 
of Medical Research (QIMR, Brisbane, Australia) and immunised at 5-7 weeks of age. Mice 
were housed in filter-topped cages in the PC3 animal facility at QIMR. Groups of 4 or 5 
mice were dosed sub-cutaneously at the tail base with 0.1ml formulation. This was 
5 followed by removal of spleens at day 21 for ex- vivo ELISPOT assay (below). 

Mice dosed sub-cutaneously, received lOjxg ISCOM™-adjuvant (as saponin) and 
lO^xg EBV polyepitope antigen. For a vaccine control group, mice were dosed sub- 
cutaneously with a peptide mixture comprising free peptides (Mimotopes Pty Ltd) 
formulated with tetanus toxoid and Montanide ISA 720 (SEPPIC, Paris, France) as . 
10. previously described (Elliot et al, 1999). Peptide control immunisations come from two 
groups of mice, one group immunised with GLCTLVAML(SEQ ID 

NO:6)/YLLEMLWRL(SEQ ID NO:9)/LLDFVRFMGV(SEQ ID NO:8) peptide mixture and 
the other YLQ/ CLG. A mixture of all five epitopes showed some insolubility problems. 

15 CTL activity (ex-vivo ELISPOT) 

Ex vivo ELISPOT measures/ quantitates both effector and memory CTL which 
secrete IFNy. Peptide-specific IFNy secreting cells are enumerated by an enzyme linked 
immuno-spot (ELISpot) assay modified from Murali-Krishna et al. Flat bottomed 96-well 
microtitre plates are coated overnight with 5 ug/ mL of rat anti-mouse IFNy antibody (clone 

20 RA-6A2, BD PharMingen, San Diego, California, USA). Coated plates are then blocked for 
1 hour with 1% FBS in PBS, and then washed three times with PBS/0.05% Tween 20 (PBS- 
T), and incubated for 1 hour at 37°C with medium comprising RPMI 1640, supplemented 
with 100 Jig/ml streptomycin and 100 IU/ml penicillin, 10% FBS and 10~ 5 M 2- 
mercaptoethanol. Mouse splenocytes were then treated with red blood cell lysis buffer, 

25 washed and resuspended to 1 x 10 7 cells /ml in medium, for use in the ex vivoWNy 

ELISPOT assay. Splenocytes (1 x 10 6 / well) are then placed in the first wells of the ELISpot 
plate and serially diluted two fold. Recombinant human IL-2 (kindly provided by Cetus 
Corp., Emeryville, California, USA) is then added to the plate at a final concentration of 5 
IU/well together with EBV peptide (Mimotopes Pty Ltd) at a final concentration of 100 

30 |xg/ml. Media containing IL-2 without peptide is added to control wells. The final volume 
in each well is 100 pi. Plates are incubated at 37°C in 5% CO z for approximately 18 hours. 
After incubation, cells are lysed by rinsing the plates in H 2 0 and then washed twice in PBS- 
T. Biotinylated anti-mouse IFNy antibody clone XMG1.2 (BD Phar Mingen) is diluted 1:500 



1 
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(2 Mg/ml final concentration) in PBS-T/5% FBS and added to all wells at 50 ^I/well and 
incubated for 2 hours at RT. Plates are then washed in PBS-T and streptavidin-alkaline 
phosphatase, diluted 1:400 in PBS-T/5% FBS, is added at 50 jol per well and incubated for a 
further 2 hours. After washing, plates are developed by adding Sigma Fast BCIP/NBT 
substrate at 50 ul/ well. Plates are incubated at 37°C for approximately 20 minutes to allow 
colour development, and then washed with water to stop the reaction. IFNy specific spots 
are counted using KS ELISPOT Reader (Zeiss). 

RESULTS 
Epitope Fusions 

26 EBV CTL epitopes were selected to provide >90% human population coverage 
for a vaccine formulation. 

In order to link these CTL epitopes (Table 1) together and facilitate the design of a 
polyepitope antigen to form the basis of an EBV prophylactic vaccine, the hydrophobicity 
value of each epitope was calculated using Pinsoft 2 software (Table 2). Two versions of 
26 epitopes were then ordered into configurations PT26A and PT26B (Figure 1), which 
reduced peak hydrophobicity and hydrophobic sequence length (Figure 2). When these 
constructs were cloned for expression in E coli it was found that only one of the 
configurations (PT26A) was able to produce a polyepitope polypeptide (Figure 4A). PT26B 
was not produced (Figure 4C). 

To identify a potential reason for this unexpected finding, local areas of high 
hydrophobicity were examined by summation of overlapping hydrophobicity values 
(Pinsoft 2, Mimotopes Pty Ltd) to provide a hydrophobicity index (HI) over varying 
numbers of peptides in a group (n). For n=2, no correlation was apparent. However, for 
n=3 and n=4 (Table 3), the highest HI values for the expressed sequence (PT26A) were 
lower (1.79 and 2.51 respectively) than the highest values obtained for the non-expressed 
sequence (2.02 and 2.54 respectively). For n=5, again no significant differences were seen. 
This would indicate that there were local areas of slightly higher hydrophobicity in PT26B 
than PT26A. 

The above analysis can be represented by the mathematical expression:- 

e=m+n-l 

e=m 
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(where m = group number, n = group size, e = integer (epitope) number, x = epitope 
hydophobi city value). 

Overall, the results with n=3 and n=4 appeared to show the most distinctive 
differences. This would make the prediction that to be able to express linked, random, 
short amino acid sequences in E coh 'in SDS-PAGE detectable amounts, the hydrophobic 
index over groups of three epitope sequences would preferably need to be less than 1.8 (HL 
< 1.8) and/ or that over groups of four the hydrophobic index would preferably need to be 
less than 2.5 (HI 4 < 2.5), where x was calculated using Pinsoft 2 (Mimotopes Pty Ltd) and 
specifying the N-terminus as N-acetyl and the C-terminus as carboxy amide. Different cut- 
off values will be obtained with different hydrophobicity algorithms. 

Table 4 shows the HI values over 3 peptides (n=3) for 15 random arrangements of 
the 26 EBV CTL epitopes that were generated and analysed by calculating the HI. This 
shows that all 15 random configurations contain multiple HI values which are all in the 
range predicted to preclude the production of a recombinant polypeptide in E coli (ie HI 
greater than 1.8). This also shows that the arrangements made in accordance with the 
present methods are unlikely to be arrived at without application of the insights embodied 
in the present invention. 
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Fusions of these polyepitopes with the N-terminal 400 amino acids of a naturally-occurring 
EBV protein (gp350). 

Provision of CD4 help has previously been shown to improve CTL induction (Thuy 
et al, 2001) and the EBV structural protein gp350 was identified as the preferred candidate 
to provide this property because it would provide cognate help. Hence the two 
polyepitopes, PT26A and PT26B, the latter of which was unable to be expressed in R coli, 
were cloned onto the C-terminus of the N-terminal 400 amino acids of gp350 (Figure 3) and 
expressed as the fusion protein in £. coli. Both of these fused polypeptides were well- 
produced, being clearly visible on a Coomassie-stained gel above the background of R coli 
proteins (profile for PT26A shown in Figure 4B). 

CTL activity 

Vaccination of HLA A2 transgenic mice with either gp350-PT26A-ISCOM™ vaccine 
or gp350-PT26B~ISCOM™ vaccine induced a CTL immune response (IFN ELISPOT) to all 
five A2 epitopes (Figure 5), thus indicating that all A2 epitopes were properly processed 
and presented to the immune system. 

Example 2: EBV polvepitope fusions as nasopharyngeal carcinoma vaccine 
candidates. 

MATERIALS AND METHODS 
Epitope sequences 

The 20 CTL epitopes for inclusion in an EBV-NPC vaccine, the proteins from which 
they originate and HLA type are shown in Table 5. 

Design/Ordering of epitopes 

An ordered arrangement of CTL epitopes to produce a polyepitope sequence with 
favourable hydrophobicity characteristics was generated by the method described in 
Example 1 (i) - (xi). 

The final shuffle involved taking YPLHEQHGM (SEQ ID NO:23) from position 3 
and changing with CLGGLLTMV (SEQ ID NO:16) at position 19 to reduce a high 
hydrophobic index (HI) which resulted from summing epitopes 18, 19 and 20. 

The ordering process for this optimised EBV-NPC polyepitope (EBV-NPCa) 
configuration is shown in Table 6. 
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Hydrophobic Index (HI) calculations 

HI values for EBV-NCPa were calculated according to the mathematical expression:- 

e=m+n-l 
HI m -2x e 
e=m 

(where m = group number, n = group size, e = integer (epitope) number, x = epitope 
hydophobicity value) over values of n from 2 to 5). 

Generation of DNA constructs 

The amino acid sequence of the EBV-NPC polyepitope was back translated to DNA 
using the Dnastar Editseq software (DNASTAR Inc, Madison, Wisconsin, USA) and codons 
optimised for if. coli expression. A C-terminal hexa-histidine tag was incorporated for 
purification and detection. The DNA encoding the polyepitope was generated from 
synthetic oligonucleotides using a Splicing by Overlap Extension technique (SOE) as 
described by Horton et al (1995). This was cloned into pET24b (Novagen) and the resulting 
construct sequenced to ensure that no errors were present (Big Dye Terminator Kit V3.1; 
Applied Biosystems). For expression purposes, DNA was transformed into E coli 
. BL21(DE3) cells (Novagen). 

Protein Expression and Analysis 

Tranformed cells were grown at 37°C in Terrific Broth containing 50mg/ml 
Kanamycin. At an OD600 of -2, protein expression was induced by addition of 0.5mM 
IPTG and samples taken at 1 hour ,2 hours, 3 hours and overnight post-induction. Cells 
were pelleted, resuspended to equal densities and boiled in SDS sample buffer prior to 
analysing by SDS-PAGE on Novex 4-20% Tris-Glycine gels. Gels were analysed both by 
Coomassie Blue staining and immunoblotting. Blots were probed with Dianova anti hexa- 
histidine monoclonal antibody. 
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TABLE 5: CTL epitopes included in the EBV-NPCa polyepitope 



. HLA Type 


EBV Protein 


Epitope 


All 


LMP2 


bbCSbCrLbKl (.bbQ ID NO: 12) 


A23, A24 


LMP2 


T*)\/T THT A TT A A T / PT?/^ TT""\ MA 1 0\ 

PYLFWLAAI (SEQ ID NO: 13) 


A24 


LMP2 


TYGPVFMCL (SEQ ID NO:14) 


A25 


LMP2 


T j~k i-pxTrrrr T O A TA7 /Ct?A TTN \TA 0/*"\ 

VMSNTLL5A W (SEQ ID NO:36) 


A2.1 


LMP2 


CLGGLLTMV (SEQ ID NO:6) 


A2.3 


LMP2 


T T C A TA7TT HP A /PT7A TT\ MA ory\ 

LLSA WILT A (SEQ ID NO:37) 


A2.6 


LMP2 


T T 1 A /""■'CT TT7T /OTA TT"\ \TA.OO\ 

LTAGFLIFL (SEQ ID NO:38) 


B8 


LMP2 


CPLSKILL (SEQ ID NO:39) 


B27 


LMP2 


RlxRWIvRLTV (SEQ ID NO:40) 


B40 


LMP2 


tiii >m ix TO T /PPA TT*\ "X r\f\\ 

IEDPPFNSL (SEQ ID NO:30) 


A2, A68, A69 


LMP1 


YLLEMLWRL (SEQIDNO:9) 


A2 


LMP1 


YLQQNWWTL (SEQ ID NO:10) 


A2 


LMP1 


A T T X TT \/Pr» A /PPA 1 1 *\ X T/""\ /t1 \ 

ALLVLYSFA(SEQ ID NO:4l) 


B57, B58 


T X /CT">1 

LMP1 


TAT V/T /^v/TNX TT A 7 /C~T?f~\ TT^ X TA..OON 

IALYLQQNW (SEQ ID NO:28j 






"PT P A T? A 0\7T^ fC.T70 TT) XTTV1 1 ^ 
lxLlvfTJ}/\V^ V ix v DJl1s^ XU In \-J. XX) 


B8 


EBNA3 


QAKWRLQTL (SEQ ID NO:18) 


B35 


EBNA3 


YPLHEQHGM (SEQ ID NO:23) 


B27 


EBNA4 


HRCQAERKK (SEQ ID NO:20) 


B62 


EBNA4 


GQGGSPTAM (SEQ ID NO:31) 


B8 


BZLF1 


RAKFKQLL (SEQ ID NO:19) 
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Example 3: HIV polvepitope fusions as vaccine candidates. 

MATERIALS AND METHODS 
Epitope sequences 

The 26 CTL epitopes for inclusion in a HIV vaccine, the proteins from which they 
originate and HLA type are shown in Table 7. 

Design/ Ordering of epitopes 

An ordered arrangement of CTL epitopes to produce a polyepitope sequence with 
favourable hydrophobicity characteristics was generated by the method described in 
Examplel (i) - (xi). At the same time, five random sequences were generated and one of 
these (HTVb) was taken for. comparison with the optimised, sequence. 

The ordering process for the optimised HIV polyepitope (HIVa) configuration is 
. shown in Table 8. 

Hydrophobic Index (HI) calculations 

HI values for both configurations (HIVa and HlVb) were calculated according to 
the mathematical expression:- 

e=m+n-l 
HI m = 2x e 
e=m 

(where m = group, number, n = group size, e = integer (epitope) number, x = epitope 
hydophobicity value) over values of n from 2 to 5). 

Generation ofDNA constructs 

Each polyepitope amino acid sequence was back translated to DNA using the 
Dnastar Editseq software and codons optimised for E coli expression. C-terminal hexa- 
histidine tags were incorporated for purification and detection. The DNA encoding the 
polyepitopes was generated from synthetic oligonucleotides using a Splicing by Overlap 
Extension technique (SOE) as described by Horton et al (1995). These were cloned into 
pET24b (Novagen) and the resulting constructs sequenced to ensure that no errors were 
present (Big Dye Terminator Kit V3.1; Applied Biosystems). For expression purposes, 
DNA was transformed into E coli BL21(DE3) cells (Novagen). 
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Protein Expression and Analysis 

Tranformed cells were grown at 37°C in Terrific Broth containing 50mg/ml 
Kanamycin. At an OD600 of ~2, protein expression was induced by addition of 0.5mM 
IPTG and samples taken at 1 hour, 2 hours, 3hours and overnight post-induction. Cells 
were pelleted, resuspended to equal densities and boiled in SDS sample buffer prior to 
analysing by SDS-PAGE on Novex 4-20% Tris-Glycine gels. Gels were analysed both by 
Coomassie Blue staining and immunoblotting. Blots were probed with Dianova anti hexa- 
histidine monoclonal antibody. 

RESULTS 
Epitope fusions 

26 HIV CTL epitopes were selected to provide the components of a vaccine 
formulation. In order to link these CTL epitopes (Table 5) together and facilitate the design 
of a polyepitope antigen to form the basis of a HTV vaccine, the hydrophobicity value of 
each epitope was calculated (as in the case of Example 1) using Pinsoft 2 software. Two 
versions of the 26 epitopes were then created (Figure 7), one which reduced peak 
hydrophobicity and hydrophobic sequence length (HlVa) and another in whicb the 
epitopes were randomly arranged (HIVb). These constructs were then assembled and 
cloned into Ecoli for a comparison of their ability to produce a polyepitope polypeptide. 

HIVa (optimised polyepitope) 

Good expression of polyepitope polypeptide at the predicted MW of 29kDa was 
observed on Coomassie stained gels from the 1 hour time point and the protein was 
recognised by an anti hexa-histidine monoclonal antibody on the immunoblot (Figures 9 
and 10). Maximum expression was reached by 1 hour post-induction. Leaky, uninduced 
expression was seen at each time point, gradually increasing to induced levels after 
overnight incubation. A potential dimer was visible only by immunoblot after overnight 
induction, with some higher molecular weight material also being detected at 3 hours and 
overnight post-induction timepoints. 

HTVb (random polyepitope) 

No polyepitope polypeptide was detected on the stained gel or by immunoblotting 
(Figures 9 & 10). 
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DISCUSSION 

Use of the above algorithm to order the 26 CTL epitopes has allowed the expression 
of an HIV polyepitope polypeptide in good yields. On the other hand, expression of a 
polyepitope sequence which was randomly ordered failed to yield any detectable product. . 
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TABLE 7: CTL epitopes included in the HIV polytopes 



HLA Type 


HIV Protein 


Epitope 

JL IT 


A3 


pl7 


RLRPGGKKK (SEQ ID NO: 42) 


A*2402,A24,A23 


pl7 


KYKLKHIVW (SEQ ID NO: 43) 


B35 


pl7 


WASKELERF (SEO ID NO- M) 


B7,B8 


pl7 


RPGGKKKYKL (SEO ID NO- 4S , 


A2,A*0202 / B62 


pl7 


SLYNTVATL (SEO ID NO- 46") 


B8,B60 


pl7 


EIKDTKEAL (SEO ID NO- 47) 


A2,A*0202 


p24 


TLNAWVKW (SEO ID NO- 48) 


B7,B42,B53 


p24 


TPODLNTML (SEO ID NO- 491 


A2 


p24 


MTNNPPIPV (SEO ID NO- 50) 


B27 


p24 


KRWELGLNK (SEO ID NO- 51) 


- A28, A74 


Protease 


VTLWORPLV (SEO ID NO- 52) 


A2, A*0201 


Protease 


LVGPTPVNI (SEO ID NO- 5^) 


A2,A0201 


Reverse Transcriptase 


ALVEICTEM (SEO ID NO- 54) 


A2,B51 


Reverse Transcriptase 


YTAFTIPSI (SEO ID NO- 55) 


A3,AH 


Reverse Transcriptase 


AxEOSSMTK (SEO ID NO- 56) 


A2,A*0201 


Reverse Transcriptase 


VIYOYMDDL (SFO TD NO- 57) 


A2,B60 


Reverse Transcriptase 


KIEELROHL (SFO TD NO- 58) 

<xX_UL-»X-< XjXXV^X xJ— i ^lJXjV^ XX-/ iNVJ, \JO J 


A2 


Reverse Transcriptase 


ILKEPVHGV (SEO ID NO- 591 


All 


Reverse Transcriptase 


TYOEPFKNTLK (SEO ID NO- 60) 


B44 


Reverse Transcriptase 


TWETWWTEYW (SEO ID NO- 61 ) 

X w T Xl X I T FT X Xl X T V y±J±*»\£ JIL/ X N V»/ • W X / 


A2 


Reverse Transcriptase 


PLVKLWYQL (SEQ ID NO: 62) 


B7,A3 


Reverse Transcriptase 


YLAWVPAHK (SEQ ID NO: 63) 


A2,A*0201 


Integrase 


LLWKGEGAV (SEQ ID NO: 64) 


A2,A*0201 


Vpr 


A1TRILQQL (SEQ ID NO: 65) 


B7 


Vpu 


IPIVAIVALV (SEQ ID NO: 66) 


A2,A2.1 


gpl60 


KLWVTVYYGV (SEQ ID NO: 67) 
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Example 4: HCV polvepitope fusions as vaccine candidates. 

MATERIALS AND METHODS 
Epitope sequences 

The 26 CTL epitopes for inclusion in a HCV vaccine, the proteins from which they 
originate and HLA type are shown in Table 9. 

Design/ Ordering of epitopes 

An ordered arrangement of CTL epitopes to produce a polyepitope sequence with 
favourable hydrophobicity characteristics was generated by the method described in 
Examplel (i) - (xi). The final shuffle involved taking CTCGSSDLY (SEQ ID NO:68) from 
position 3 and changing with FLLLADARV (SEQ.ID NO:69) at position. 26 to reduce a high 
hydrophobic index (HI) which resulted from summing epitopes 24, 25 and 26. At the same 
time/five random sequences were generated and one of these (HCVb) was taken for 
comparison with the optimised sequence. 

The ordering process for the optimised HCV polyepitope (HCVa) configuration is 
shown in Table 10. 

Hydrophobic Index (HI) calculations 

HI values for both configurations (HCVa and HCVb) were calculated according to 
the mathematical expression:- 

e=m+n-l 
HI m = Zx e 
e=m 

(where m = group number, n = group size, e = integer (epitope) number, x « epitope 
hydophobicity value) over values of n from 2 to 5). 

Generation of DMA constructs 

Each polyepitope amino acid sequence was back translated to DNA using the 
Dnastar Editseq software and codons optimised for E coli expression. C-terminal hexa- 
histidine tags were incorporated for purification and detection. The DNA encoding the 
polyepitope sequences was generated from synthetic oligonucleotides using a Splicing by 
Overlap Extension technique (SOE) as described by Horton et al (1995). These were cloned 
into pET24b (Novagen) and the resulting constructs sequenced to ensure that no errors 
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were present (Big Dye Terminator Kit V3.1; Applied Biosystems). For expression purposes, 
DNA was transformed into F. co7/BL21(DE3) cells (Novagen). 

Protein Expression and Analysis 

Tranformed cells were grown at 37°C in Terrific Broth containing 50mg/ml 
Kanamycin. At an OD600 of ~2, protein expression was induced by addition of 0.5mM 
IPTG and samples taken at 1 hour, 2 hours, 3 hours and overnight post-induction. Cells 
were pelleted, resuspended to equal densities and boiled in SDS sample buffer prior to 
analysing by SDS-PAGE on Novex 4-20% Tris-Glycine gels. Gels were analysed both by 
Coomassie Blue staining and immunoblotting. Blots were probed with Dianova anti hexa- 
histidine monoclonal antibody^ 

RESULTS 
Epitope fusions 

26 HCV CTL epitopes were selected to provide the components of a vaccine 
formulation. In order to link these CTL epitopes (Table 7) together and facilitate the design 
of a polyepitope polypeptide to form the basis of a HCV vaccine, the hydrophobicity value 
of each epitope was calculated (as in the case of Example 1) using Pinsoft 2 software. Two 
versions of the 26 epitopes were then created (Figure 8), one which reduced peak 
hydrophobicity and hydrophobic sequence length (HCVa) and another in which the 
epitopes were randomly arranged (HCVb). These constructs were then assembled and 
cloned into E colitor a comparison of their ability to produce a polyepitope polypeptide. 

HCVa (optimised polyepitope) 

While induced monomeric polyepitope polypeptide was not visible on Coomassie 
stained gels, protein of the predicted MW of 27.5kDa was detected at 2 and 3 hours post- 
induction when immunoblots were probed with the anti hexa-histidine antibody (Figures 9 
and 10). Over time there was an increasing presence of high molecular weight products as 
a smear, indicating protein aggregation. After overnight induction these high molecular 
weight aggregates were also visible on the Coomassie stained gel (Figure 10). 

HCVb (random polyepitope) 

There was no detection of any induced monomeric polyepitope polypeptide either 
on Coomassie stained gels or by immunoblotting (Figures 9 and 10). Two faint bands were 
detected by immunoblotting after overnight induction, however these were at -24 and 
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30kDa and were clearly not the predicted product of MW 27.5kDa. They most likely 
corresponded to two histidine-rich proteins of Rcoli, rotamase (~23kDa; NCBI Accession 
NP_417808) and a protein of unknown function (~32kDa; NCBI Accession BAA15973). 
Since the former is a peptidyl-prolyl-isomerase involved in protein folding, it is perhaps 
5 not surprising that this should be overproduced in this situation. 

DISCUSSION 

In designing a HCV polyepitope, use of the above algorithm to order the 26 CTL 
epitopes has allowed the polypeptide sequence to be expressed, mainly as an aggregated 
product. On the other hand, expression of the polyepitope sequence which was randomly 
10 ordered failed to yield any detectable product. 
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TABLE 9: CTL epitopes included in the HCV polytopes 



HLA Type 


HCV Protein 


Epitope 


A2 


• NS1/E2 


FLLLADARV (SEQ ID NO: 69) 


A2 


NS4 


YLVAYQATV (SEQ ID NO: 70) 


,A2 


NS5 


RLIVFPDLGV (SEQ ID NO: 71) 


A2 


CORE 


DLMGYTPLV (SEQ ID NO: 72) 


A2 


NS4 


WMNRLIAFA (SEQ ID NO: 73) 


A2 


NS4 


VLVGGVLAA (SEQ ID NO: 74) 


A2 


NS4 


HMWNFISGI (SEQ ID NO: 75) 


A2 


NS4 . 


ILAGYGAGV (SEQ ID NO: 76) 


A2 . 


CORE 


YLLPRRGPRL (SEQ ID NO: 77) 


• A2 


NS1/E2 


LLFLLLAD A (SEQ ID NO: 78) 


A2 


NS3 


YLVTRHADV (SEQ ID NO: 79) 


A3 


CORE 


KTSERSQPR (SEQ ID NO: 80) 


A3 


CORE 


RLGVRATRK (SEQ ID NO: 81) 


A3 


ENV 


QLFTFSPRR (SEQ ID NO: 82) 


A3 


NS1/E2 


RMYVGGVEHR (SEQ ID NO: 83) 


A3 


NS3 


LIFCHSKKK (SEQ ID NO: 84) 


A3 


' NS4 


GVAGALVAFK (SEQ ID NO: 85) 


A3 


' NS4 


VAGALVAFK (SEQ ID NO: 86) 


A3 


NS3 


TLGFGAYMSK (SEQ ID NO: 87) 


B7. 


CORE 


LPGCSFSIF (SEQ ID NO: 88) 


Al 


NS5 


LSAFSLHSY (SEQ ID NO: 89) 


Al 


NS3 


CTCGSSDLY (SEQ ID NO: 68) 


A24 


NS4B 


FWAKHMWNF (SEQ ID NO: 90) 


A31 


NS5 


VGIYLLPNR(SEQIDNO:91) ! 


A2 


NS4 


LLFNILGGWV (SEQ ID NO: 92) 


B7 


NS3 


IPFYGKAI (SEQ ID NO: 93) 
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It will be appreciated by persons skilled in the art that numerous variations and /or 
modifications may be made to the invention as shown in the specific embodiments without 
departing from the spirit or scope of the invention as broadly described. The present 
embodiments are, therefore, to be considered in all respects as illustrative and not 
5 restrictive. 
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